AITopics | pretext task

CroCo: Self-Supervised Pre-training for 3DVision Tasks by Cross-View Completion

Neural Information Processing SystemsApr-24-2026, 19:33:43 GMT

Masked Image Modeling (MIM) has recently been established as a potent pretraining paradigm. A pretext task is constructed by masking patches in an input image, and this masked content is then predicted by a neural network using visible patches as sole input. This pre-training leads to state-of-the-art performance when finetuned for high-level semantic tasks, e.g.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Useful Facts

Neural Information Processing SystemsApr-24-2026, 10:32:24 GMT

A.1 Relation of Inverse Covariance Matrix and Partial Correlation For a covariance matrix of joint distribution for variables X,Y, the covariance matrix is The derivation comes from the following: Lemma A.1 (Conditional independence (Adapted from [34])). Notice for arbitrary function f, E[f(X)|Y] = EL[f(X)|φy(Y)] with one-hot encoding of discrete variable Y. Therefore for any feature map we can also get that conditional independence ensures: This thus finishes the proof for Lemma D.4. A.3 Technical Facts for Matrix Concentration We include this covariance concentration result that is adapted from Claim A.2 in [18]: Claim A.2 (covariance concentration for gaussian variables). Let X = [x1,x2, xn]> Rn d where each xi N(0,ΣX). Then for any given matrix B Rd m that is of rank kand is independent of X, with probability at least 1 δ10 over X we have 0.9B>ΣXB 1 n B>X>XB 1.1B>ΣXB. Let X = [x1,x2, xn]> Rn d where each xi is ρ2-sub-gaussian. Then for any given matrix B Rd m that is of rank kand is independent of X, with probability at least 1 δ10 over X we have 0.9B>ΣXB 1 n B>X>XB 1.1B>ΣXB. Let Z Rn k be a matrix with row vectors sampled from i.i.d Gaussian distribution N(0,ΣZ). Let P Rn n be a fixed projection onto a space of dimension d.

artificial intelligence, machine learning, xdown1, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

02e656adee09f8394b402d9958389b7d-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 10:32:21 GMT

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

bf121b033db3bac31c3193e8a0dcbf66-Paper-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 10:53:59 GMT

adaptation, domain adaptation, pretext task, (15 more...)

Neural Information Processing Systems

Country:

Asia > India (0.14)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

Neural Information Processing SystemsFeb-15-2026, 21:03:20 GMT

To answer this question, we begin by revisiting the forward procedure of ViTs. A sequence of positional embeddings (PEs) [51] is added to patch embeddings to preserve position information. Intuitively, simply discarding these PEs and requesting the model to reconstruct the position for each patch naturally becomes a qualified location-aware pretext task.

artificial intelligence, computer vision, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Heilongjiang Province > Daqing (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Representation Learning via Consistent Assignment of Views over Random Partitions

Neural Information Processing SystemsFeb-15-2026, 10:42:37 GMT

CARP learns prototypes in an end-to-end online fashion using gradient descent without additional non-differentiable modules to solve the cluster assignment problem. CARP optimizes a new pretext task based on random partitions of prototypes that regularizes the model and enforces consistency between views' assignments.

artificial intelligence, machine learning, representation, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
South America > Brazil (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)
Information Technology > Artificial Intelligence > Vision (0.93)
(2 more...)

Add feedback

Self-SupervisedLearningvia MaximumEntropyCoding

Neural Information Processing SystemsFeb-12-2026, 07:47:46 GMT

Self-supervised learning (SSL) aims to learn rich and meaningful representations without relying onhumanannotations.

artificial intelligence, machine learning, representation, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

c81e155d85dae5430a8cee6f2242e82c-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 03:22:19 GMT

dataset, learning, representation, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SupplementaryMaterialsVIME: ExtendingtheSuccessofSelf-and Semi-supervisedLearningtoTabularDomain

Neural Information Processing SystemsFeb-9-2026, 02:59:49 GMT

Semisupervised learning uses the trained encoder in learning a predictive model on both labeled and unlabeleddata. Figure 3: The proposed data corruption procedure. Original feature matrix(X) consists of four samples xi,i = 1...,4, where each row/column represents a sample/feature, and the features in each sample are represented by the same color. In the experiment section of the main manuscript, we evaluate VIME and its benchmarks on 11 datasets(6genomics,2clinical,and3publicdatasets). The selected SNPs and the corresponding blood cell trait together form an independent labeled dataset.

artificial intelligence, dataset, machine learning, (12 more...)

Neural Information Processing Systems

Country: